[Picture Gallery]
 

"Localization and Khmer Language Processing"
   
Overview

Local language computing in Cambodia primarily started with development of local language fonts for Microsoft and open source platform.  Since then efforts for the development of local language applications have been slowly underway.  Recently, with the inclusion of Khmer script into Unicode, local language computing in Cambodia has now been progressing rapidly.

Through PAN Localization project, Cambodia country component aims to develop few basic to intermediate local language applications in Khmer.  Cambodia country team of PAN Localization project started working on development on localized technology with limited expert resources to develop such technology.  The team comprised of fresh graduate of computer science with limited to no experience in the development of local language applications.

   
Objectives

 

The basic objective of this training was to train the Cambodian human resources for developing localization applications and to help them successfully meet their planned deliverables. The team received training on basic to advanced programming and basic to advanced language processing techniques.  Broadly the training comprised of the following topics:

  • OpenType font development

  • Unicode language Processing

  • Lexicon Development

  • Project life cycle from design to execution and testing

  • Advance programming in C++

  • Visual Basic.NET

Contracted Cambodia country component projects deliverables for the PAN Localization project included:

  • Mapping from existing code charts to Unicode

  • Collation/Sorting sequence

  • Lexicon Development

  • Mobile Interface in Khmer

  • Terminology Translation for interface

  • Spell Check

Through the training the country team was able to achieve the first three project deliverables within first six months of the project inception. 

   
Challenges

Primary challenge in conducting this training was to introduce localization to an audience that was completely un-familiar with similar application development.  Even the most basic localization concept of Unicode handling was unfamiliar with most of the team members. 

   
Discussion and Conclusion

 

The training was very successful and many more advanced topics on localization were covered than planed initially. The human resources were trained in a way that they could carry on with the implementation of the rest of the project deliverables efficiently. A delegation of IDRC personnel also visited the Cambodia country component office 18th of November 2004 and appreciated the project team’s work.

At the end of this training a general workshop on Localization technologies was also organized to share the project work with general public. Presentations were delivered by the project team members.  This workshop was attended by distinguished representatives from different ministries and colleges.

Specifically the topics covered during the workshop were:

  • Introduction to Khmer Standardization presented by Mr. Chea Sok Huor

  • Introduction to Unicode and OpenType Fonts presented by Mr. Atif Gulzar

  • Introduction to ScanFont, FontLab and Volt presented by Mr. Atif Gulzar

  • Khmer OpenType font development presented by Mr. Atif Gulzar

  • Introduction to Khmer Collation Sequence presented by Miss. Ros Pich Hemy

  • Introduction to Khmer Lexicon Development by presented Mr. Chhoeun Tola

  • Conversion of non-Unicode documents to Unicode presented by Mr. Atif Gulzar

 

 

[Picture Gallery]